Disambiguation of Finite-State Transducers
نویسندگان
چکیده
The objective of this work is to disambiguate transducers which have the following form: T = R ◦D and to be able to apply the determinization algorithm described in (Mohri, 1997). Our approach to disambiguating T = R ◦D consists first of computing the composition T and thereafter to disambiguate the transducer T . We will give an important consequence of this result that allows us to compose any number of transducers R with the transducer D, in contrast to the previous approach which consisted in first disambiguating transducers D and R to produce respectively D ′ and R ′ , then computing T ′ = R ′ ◦D ′ where T ′ is unambiguous. We will present results in the case of a transducer D representing a dictionary and R representing phonological rules.
منابع مشابه
Use of Weighted Finite State Transducers inPart of Speech
This paper addresses issues in part of speech disambiguation using nite-state transducers and presents two main contributions to the eld. One of them is the use of nite-state machines for part of speech tagging. Linguistic and statistical information is represented in terms of weights on transitions in weighted nite-state transducers. Another contribution is the successful combination of techni...
متن کاملA Disambiguation Algorithm for Finite Automata and Functional Transducers
We present a new disambiguation algorithm for finite automata and functional finite-state transducers. We give a full description of the algorithm, including a detailed pseudocode and analysis, and several illustrating examples. Our algorithm is often more efficient and the result dramatically smaller than the one obtained using determinization for finite automata or an existing disambiguation ...
متن کاملUse of Weighted Finite State Transducers in Part of Speech Tagging
This paper addresses issues in part of speech disambiguation using finite-state transducers and presents two main contributions to the field. One of them is the use of finite-state machines for part of speech tagging. Linguistic and statistical information is represented in terms of weights on transitions in weighted finite-state transducers. Another contribution is the successful combination o...
متن کاملImplementing Voting Constraints With Finite State Transducers
We describe a constraint-based morphological disambiguation system in which individual constraint rules vote on matching morphological parses followed by its implementation using finite state transducers. Voting constraint rules have a number of desirable properties: The outcome of the disambiguation is independent of the order of application of the local contextual constraint rules. Thus the r...
متن کاملMorphological Disambiguation by Voting Constraints
We present a constraint-based morphological disambiguation system in which individual constraints vote on matching morphological parses, and disambiguation of all the tokens in a sentence is performed at the end by selecting parses that receive the highest votes. This constraint application paradigm makes the outcome of the disambiguation independent of the rule sequence, and hence relieves the...
متن کاملUtterance Classification In AutoTutor
This paper describes classification of typed student utterances within AutoTutor, an intelligent tutoring system. Utterances are classified to one of 18 categories, including 16 question categories. The classifier presented uses part of speech tagging, cascaded finite state transducers, and simple disambiguation rules. Shallow NLP is well suited to the task: session log file analysis reveals si...
متن کامل